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Abstract. The Directed Maximum Leaf Out-Branching problem 
is to find an out-branching (i.e. a rooted oriented spanning tree) in a given 
digraph with the maximum number of leaves. In this paper, we obtain 
two combinatorial results on the number of leaves in out-branchings. We 
show that 

— every strongly connected n-vertex digraph D with minimum in- 
degree at least 3 has an out-branching with at least (n/4) 1//;i — 1 
leaves; 

— if a strongly connected digraph D does not contain an out-branching 
with k leaves, then the pathwidth of its underlying graph UG(D) is 
O(fclogfc). Moreover, if the digraph is acyclic, the pathwidth is at 
most 4k. 

The last result implies that it can be decided in time 2° (fclog2 k) ■ n° (1) 
whether a strongly connected digraph on n vertices has an out-branching 
with at least k leaves. On acyclic digraphs the running time of our algo- 
rithmis2°( fcl °e fc > ■ . 



1 Introduction 

In this paper, we initiate the combinatorial and algorithmic study of a natural 
generalization of the well studied Maximum Leaf Spanning Tree (MLST) 
problem on connected undirected graphs [11,16,19-21,24,26,32,34]. Given a 
digraph D, a subdigraph T of D is an out-tree if T is an oriented tree with 
only one vertex s of in-degree zero (called the root). If T is a spanning out- 
tree, i.e. V(T) — V(D), then T is called an out-branching of D. The vertices of 
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T of out-degree zero are called leaves. The Directed Maximum Leaf Out- 
Branching (DMLOB) problem is to find an out-branching in a given digraph 
with the maximum number of leaves. 

It is well-known that MLST is NP-hard for undirected graphs [25], which 
means that DMLOB is NP-hard for symmetric digraphs (i.e., digraphs in which 
the existence of an arc xy implies the existence of the arc yx) and, thus, for 
strongly connected digraphs. We can show that DMLOB is NP-hard for acyclic 
digraphs as follows: Consider a bipartite graph G with bipartition X, Y and a 
vertex s £ V(G). To obtain an acyclic digraph D from G and s, orient the edges 
of G from X to Y and add all arcs sx, x e X. Let B be an out-branching in D. 
Then the set of leaves of B is Y U X', where X' C X, and for each y e Y there 
is a vertex z e Z = X\X' such that zy e A(D). Observe that B has maximum 
number of leaves if and only if Z C X is of minimum size among all sets Z' C X 
such that Na(Z') = X. However, the problem of finding Z' of minimum size 
such that Nq(Z') = X is equivalent to the Set Cover problem {{Ng{v)\ y e Y} 
is the family of sets to cover), which is NP-hard. 

The combinatorial study of spanning trees with maximum number of leaves 
in undirected graphs has an extensive history. Linial conjectured around 1987 
that every connected graph on n vertices with minimum vertex degree S has a 
spanning tree with at least n(S — 2)/(<5 + 1) + c$ leaves, where cs depends on 
5. This is indeed the case for all <5 < 5. Kleitman and West [28] and Linial and 
Sturtevant [31] showed that every connected undirected graph G on n vertices 
with minimum degree at least 3 has a spanning tree with at least n/4 + 2 leaves. 
Griggs and Wu [26] proved that the maximum number of leaves in a spanning 
tree is at least n/2 + 2 when (5 = 5 and at least 2n/5 + 8/5 when (5 = 4. All these 
results are tight. The situation is less clear for 5 > 6; the first author observed 
that Linial's conjecture is false for all large values of S. Indeed, the results in [2] 
imply that there are undirected graphs with n vertices and minimum degree (5 in 
which no tree has more than (1 — (1 + o(l)) '"j^ 1 ^ )n leaves, where the o(l)-term 
tends to zero an 5 tends to infinity, and this is essentially tight. See also [3], pp. 
4-5 and [13] for more information. 

In this paper we prove an analogue of the Kleitman- West result for directed 
graphs: every strongly connected digraph D of order n with minimum in-degree 
at least 3 has an out-branching with at least (n/4) 1 / 3 — 1 leaves. We do not know 
whether this bound is tight, however we show that there are strongly connected 
digraphs with minimum in-degree 3 in which every out-branching has at most 
0(y/n) leaves. 

Unlike its undirected counterpart which has attracted a lot of attention in all 
algorithmic paradigms like approximation algorithms [24,32,34], parameterized 
algorithms [11, 19,21], exact exponential time algorithms [20] and also combina- 
torial studies [16,26,28,31], the Directed Maximum Leaf Out-Branching 
problem has largely been neglected until recently. The only paper we are aware 
of is the very recent paper [18] that describes an 0(v / OPT)-approximation algo- 
rithms for DMLOB. 
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Our second combinatorial result relates the number of leaves in a DMLOB 
of a directed graph D with the pathwidth of its underlying graph UG(D). (We 
postpone the definition of pathwidth till the next section.) If an undirected graph 
G contains a star as a minor, then it is possible to construct a spanning 
tree with at least k leaves from this minor. Otherwise, there is no minor 
in G, and it is possible to prove that the pathwidth of G is O(k). (See, e.g. [8].) 
Actually, a much more general result due to Bienstock et al. [10]) is that any 
undirected graph of pathwidth at least k, contains all trees on k vertices as a 
minor. We prove a result that can be viewed as a generalization of known bounds 
on the number of leaves in a spanning tree of an undirected graph in terms of 
its pathwidth, to strongly connected digraphs. We show that either a strongly 
connected digraph D has a DMLOB with at least k leaves or the pathwidth of 
UG(D) is 0(k\ogk). For an acyclic digraph with a DMLOB having k leaves, 
we prove that the pathwidth is at most 4k. This almost matches the bound 
for undirected graphs. These combinatorial results are useful in the design of 
parameterized algorithms. 

In parameterized algorithms, for decision problems with input size n, and a 
parameter k, the goal is to design an algorithm with runtime f{k)n°^ 1 , where 
/ is a function of k alone. (For DMLOB such a parameter is the number of 
leaves in the out-tree.) Problems having such an algorithm are said to be fixed 
parameter tractable (FPT). The book by Downey and Fellows [17] provides an 
introduction to the topic of parameterized complexity. For recent developments 
see the books by Flum and Grohe [23] and by Niedermeier [33]. 

The parameterized version of DMLOB is defined as follows: Given a digraph 
D and a positive integral parameter k, does D contain an out-branching with at 
least k leaves? We denote the parameterized versions of DMLOB by fc-DMLOB. 
If in the above definition we do not insist on an out-branching and ask whether 
there exists an out-tree with at least k leaves, we get the parameterized Di- 
rected Maximum Leaf Out-Tree problem (denoted fc-DMLOT). 

Our combinatorial bounds, combined with dynamic programming on graphs 
of bounded pathwidth imply the first parameterized algorithms for fc-DMLOB on 
strongly connected digraphs and acyclic digraphs. We remark that the algorith- 
mic results presented here also hold for all digraphs if we consider fc-DMLOT 
rather than fc-DMLOB. This answers an open question of Mike Fellows [14, 
22,27]. However, we mainly restrict ourselves to fc-DMLOB for clarity and the 
harder challenges it poses, and we briefly consider fc-DMLOT only in the last 
section. 

Very recently, using a modification of our approach, Bonsma and Dorn [12] 
proved that cither an arbitrary digraph D has an out-branching with at most fc 
leaves or the pathwidth of UG(D') is 0(fc 3 ), where D' is the digraph obtained 
from D by deleting all arcs not contained in any out-branching of D. The bound 
0(fc 3 ) is much larger than our bounds for strongly connected and acyclic di- 
graphs, but it suffices to allow Bonsma and Dorn to show that fc-DMLOB is 
FPT, settling another open question of Fellows [22,27]. 
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This paper is organized as follows. In Section 2 we provide additional ter- 
minology and notation as well as some well-known results. We introduce locally 
optimal out-branchings in Section 3. Bounds on the number of leaves in maxi- 
mum leaf out-branchings of strongly connected and acyclic digraphs are obtained 
in Section 4. In Section 5 we prove upper bounds on the pathwidth of the un- 
derlying graph of strongly connected and acyclic digraphs that do not contain 
out-branchings with at least k leaves. In Section 6 we conclude with discussions 
and open problems. 

2 Preliminaries 

Let D be a digraph. By V(D) and A(D) we represent the vertex set and arc 
set of D, respectively. An oriented graph is a digraph with no directed 2-cycle. 
Given a subset V' C V(D) of a digraph D, let -D[V] denote the digraph induced 
by V. The underlying graph XJG(D) of D is obtained from D by omitting all 
orientations of arcs and by deleting one edge from each resulting pair of parallel 
edges. The connectivity components of D are the subdigraphs of D induced by 
the vertices of components of UG(D). A digraph D is strongly connected if, for 
every pair x, y of vertices there are directed paths from x to y and from y to 
x. A maximal strongly connected subdigraph of D is called a strong component. 
A vertex u of D is an in-neighbor [out-neighbor) of a vertex v if uv e A(D) 
(vu e A(D), respectively). The in-degree d~(v) (out-degree d + (v)) of a vertex v 
is the number of its in- neighbors (out-neighbors). 

We denote by 1(D) the maximum number of leaves in an out-tree of a digraph 
D and by £ S (D) we denote the maximum possible number of leaves in an out- 
branching of a digraph D. When D has no out-branching, we write £ S (D) = 0. 
The following simple result gives necessary and sufficient conditions for a digraph 
to have an out-branching. This assertion allows us to check whether £ S (D) > 
in time 0(\V(D)\ + \A(D)\). 

Proposition 1 ([7]). A digraph D has an out-branching if and only if D has a 
unique strong component with no incoming arcs. 

Let P = uiu 2 . . . u q be a directed path in a digraph D. An arc UiUj of D is a 
forward (backward) arc for P if i < j — 2 (j < i, respectively). Every backward 
arc of the type is called double. 

For a natural number n, [n] denotes the set {1, 2, . . . , n}. 

A tree decomposition of an (undirected) graph G is a pair (X, U) where U is 
a tree whose vertices we will call nodes and X = ({Xi | i € V(U)}) is a collection 
of subsets of V(G) such that 

1- {Jiev(u)Xi = V(G), 

2. for each edge {v, w} G E(G), there is an i £ V(U) such that v, w € X i7 and 

3. for each v G V(G) the set of nodes {i | v e Xi} forms a subtree of U. 
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The width of a tree decomposition ({JQ | i £ V(U)},U) equals max ieV (u){\^i\ — 
1}. The treewidth of a graph G is the minimum width over all tree decompositions 
of G. 

If in the definitions of a tree decomposition and treewidth we restrict U to be 
a path, then we have the definitions of path decomposition and pathwidth. We 
use the notation tw(G) and pw(G) to denote the treewidth and the pathwidth 
of a graph G. 

We also need an equivalent definition of pathwidth in terms of vertex sepa- 
rators with respect to a linear ordering of the vertices. Let G be a graph and let 
a = (vi,v 2 , ■ ■ ■ , v n ) be an ordering of V(G). For j e [n] put Vj = {vi : i e [j]} 
and denote by dVj all vertices of Vj that have neighbors in V \Vj. Setting 
vs(G, a) — max iC [„] |9Vi|, we define the vertex separation of G as 

vs(G) — min{vs(G, a): a is an ordering of V(G)}. 

The following assertion is well-known. It follows directly from the results of 
Kirousis and Papadimitriou [30] on interval width of a graph, see also [29] . 

Proposition 2 ([29,30]). For any graph G, vs(G) = pw(G). 
3 Locally Optimal Out-Branchings 

Our bounds are based on finding locally optimal out-branchings. Given a di- 
graph, D and an out-branching T, we call a vertex leaf, link and branch if its 
out-degree in T is 0, 1 and > 2 respectively Let S> 2 (T) be the set of branch 
vertices, Si(T) the set of link vertices and L(T) the set of leaves in the tree T. 
Let &2{T) be the set of maximal paths consisting of link vertices. By p(v) we 
denote the parent of a vertex v in T; p(v) is the unique in-neighbor of v. We call 
a pair of vertices u and v siblings if they do not belong to the same path from 
the root r in T. We start with the following well known and easy to observe 
facts. 

Fact 1 \St 2 {T)\<\L{T)\-l. 

Fact 2 \&i{T)\ < 2\L(T)\ - 1. 

Now we define the notion of local exchange which is intensively used in our 
proofs. 

Definition 3 ^-Arc Exchange (£-AE) optimal out-branching: An out- 
branching T of a directed graph D with k leaves is l-AE optimal if for all arc 
subsets F C A(T) and X C A(D) - A(T) of size I, (A(T) \ F) U X is either not 
an out-branching, or an out-branching with at most k leaves. In other words, T 
is l-AE optimal if it can't be turned into an out-branching with more leaves by 
exchanging I arcs. 

Let us remark, that for every fixed I, an ^-AE optimal out-branching can be 
obtained in polynomial time. In our proofs we use only l-AE optimal out- 
branchings. We need the following simple properties of l-AE optimal out-branchings. 
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Lemma 1. Let T be an 1-AE optimal out-branching rooted at r in a digraph D. 
Then the following holds: 

(a) For every pair of siblings m,d6 V(T) \ L with d^{p{v)) = 1, there is no arc 
e = (u,v) e A(D)\A(T); 

(b) For every pair of vertices u,v $ L, d^(p(v)) = 1, which are on the same 
path from the root with dist(r, u) < dist(r, v) there is no arc e = (u, v) <G 
A(D) \ A(T) (here dist{r, u) is the distance to u in T from the root r); 

(c) There is no arc (v,r), v ^ L such that the directed cycle formed by the 
(r,v)-path and the arc (v,r) contains a vertex x such that d^,{p(x)) = 1. 

Proof. The proof easily follows from the fact that the existence of any of these 
arcs contradicts the local optimality of T with respect to 1-AE. □ 

4 Combinatorial Bounds 

We start with a lemma that allows us to obtain lower bounds on £ S (D). 

Lemma 2. Let D be a oriented graph of order n in which every vertex is of 
in-degree 2 and let D have an out-branching. If D has no out-tree with k leaves, 
then n < 4k 3 . 

Proof. Let us assume that D has no out-tree with k leaves. Consider an out- 
branching T of D with p < k leaves which is 1-AE optimal. Let r be the root of 
T. 

We will bound the number n of vertices in T as follows. Every vertex of T is 
cither a leaf, or a branch vertex, or a link vertex. By Facts 1 and 2 we already 
have bounds on the number of leaf and branch vertices as well as the number 
of maximal paths consisting of link vertices. So to get an upper bound on n in 
terms of k, it suffices to bound the length of each maximal path consisting of 
link vertices. Let us consider such a path P and let x,y be the first and last 
vertices of P, respectively. 

The vertices of V(T) \ V(P) can be partitioned into four classes as follows: 

(a) ancestor vertices: the vertices which appear before x on the (r, x)-path of T; 

(b) descendant vertices : the vertices appearing after the vertices of P on paths 
of T starting at r and passing through y; 

(c) sink vertices: the vertices which are leaves but not descendant vertices; 

(d) special vertices: none-of-the-above vertices. 

Let P' = P — x, let z be the out-neighbor of y on T and let T z be the subtree 
of T rooted at z. By Lemma 1, there are no arcs from special or ancestor vertices 
to the path P' . Let uv be an arc of A(D) \ A(P') such that v G V(P'). There 
are two possibilities for u: (i) u V(P'), (ii) u e V(P') and uv is backward for 
P' (there are no forward arcs for P 1 since T is 1-AE optimal). Note that every 
vertex of type (i) is either a descendant vertex or a sink. Observe also that the 
backward arcs for P' form a vertex-disjoint collection of out-trees with roots at 
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vertices that are not terminal vertices of backward arcs for P' . These roots are 
terminal vertices of arcs in which first vertices are descendant vertices or sinks. 

We denote by {u\, u 2 , • ■ • , u s } and {vi,v 2 , . . . ,v t } the sets of vertices on P' 
which have in-neighbors that are descendant vertices and sinks, respectively. Let 
the out-tree formed by backward arcs for P' rooted at w <G {u\, . . . , u s , v\, . . . , v t } 
be denoted by T(w) and let l(w) denote the number of leaves in T(w). Observe 
that the following is an out-tree rooted at z: 

s 

T z U {(in(ui), ui), . . . , (in(u s ), u s )} U |J T( Ui ), 

i=l 

where {in(ui), . . . , in(u s )} are the in-neighbors of {u\, . . . , u s } on T z . This out- 
tree has at least X)i=i K u i) leaves and, thus, J2i=i K u i) < k—1. Let us denote 
the subtree of T rooted at x by T x and let {in(v\), . . . , in(vt)} be the in-neighbors 
of {vi, . . . , v t } on T — V(T X ). Then we have the following out-tree: 

t 

(T - V(T X )) U {(in(«i), wi), . . . , (in(v t ), v t )} U |J T( Vi ) 

i=l 

with at least J^ i=1 l(vi) leaves. Thus, J2l=i K v i) < k — 1. 

Consider a path R — vov± . . . v r formed by backward arcs. Observe that the 
arcs {viVi + i : < i < r — 1} U {vjv^ : 1 < j < r} form an out-tree with 
r leaves, where is the out-neighbor of Vj on P. Thus, there is no path of 
backward arcs of length more than k—1. Every out-tree T{w), w € {u\, . . . , u s } 
has l(w) leaves and, thus, its arcs can be decomposed into l(w) paths, each 
of length at most k—1. Now we can bound the number of arcs in all the trees 
T(w), w G {ui, . . . , u s }, as follows: X^i=i l( u i)(k~ 1) — (k— l) 2 . We can similarly 
bound the number of arcs in all the trees T(w), w G {vi, . . . , v s } by (k — l) 2 . 
Recall that the vertices of P' can be either terminal vertices of backward arcs 
for P 1 or vertices in {u\, . . . , u s , v\, . . . , v t }. Observe that s + t < 2(k — 1) since 

Ei=i < k - 1 and YLi l ( y *) <k-l. 

Thus, the number of vertices in P is bounded from above by 1 + 2(k — 1) + 
2{k- l) 2 . Therefore, 

n = \L(T)\ + \S$ 2 (T)\ + \St(T)\ 

= \l(t)\ + \sUt)\+ \ v ^\ 

< (k - 1) + (k - 2) + {2k - 3)(2k 2 - 2k + 1) 

< 4k 3 . 

Thus, we conclude that n < 4k 3 . □ 

Theorem 4. Let D be a strongly connected digraph with n vertices. 

(a) If D is an oriented graph with minimum in-degree at least 2, then £ S (D) > 
(n/4) 1 /3 _ i. 
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(b) If D is a digraph with minimum in-degree at least 3, thenl s (D) > (n/4) 1 / 3 — 
1. 

Proof. Since D is strongly connected, we have 1(D) — £ S (D) > 0. Let T be an 
1-AE optimal out-branching of D with maximum number of leaves, (a) Delete 
some arcs from A(D) \ A(T), if needed, such that the in-degree of each vertex 
of D becomes 2. Now the inequality £ S (D) > (n/4) 1 / 3 — 1 follows from Lemma 
2 and the fact that 1(D) = £ S (D). 

(b) Let P be the path formed in the proof of Lemma 2. (Note that A(P) C 
A(T).) Delete every double arc of P, in case there are any, and delete some more 
arcs from A(D) \ A(T), if needed, to ensure that the in-degree of each vertex of 
D becomes 2. It is not difficult to sec that the proof of Lemma 2 remains valid 
for the new digraph D. Now the inequality £ S (D) > (n/4) 1 / 3 — 1 follows from 
Lemma 2 and the fact that 1(D) = l s (D). □ 

Remark 5 It is easy to see that Theorem 4 holds also for acyclic digraphs D 
with £ S (D) > 0. 

While we do not know whether the bounds of Theorem 4 are tight, we can 
show that no linear bounds are possible. The following result is formulated for 
Part (b) of Theorem 4, but a similar result holds for Part (a) as well. 

Theorem 6. For each t > 6 there is a strongly connected digraph H t of order 
n = t 2 + 1 with minimum in-degree 3 such that < £ s (H t ) — 0(t). 

Proof. Let V(H t ) = {r} U {u\, u\, . . . , u\ \ i e [*]} and 

A(H t ) = {«j«5 +1 , «5 +1 «j I i G [t],j € {0, 1, . . . ,t - 3}} 

U{«H-2 I * e [t],j e {3,4,...,t-2}} 

\J{u)u\ I te [t],t-3<j^q<t}, 
where u l = r for every i€ [t]. It is easy to check that < £ s (H t ) = 0(t). □ 

5 Pathwidth of underlying graphs and parameterized 
algorithms 

By Proposition 1, an acyclic digraph D has an out-branching if and only if D 
possesses a single vertex of in-degree zero. 

Theorem 7. Let D be an acyclic digraph with a single vertex of in-degree zero. 
Then either I S (D) > k or the underlying undirected graph of D is of pathwidth 
at most Ak and we can obtain this path decomposition in polynomial time. 

Proof. Assume that £ S (D) < k — 1. Consider a 1-AE optimal out-branching T 
of D. Notice that |£(T)| < k — 1. Now remove all the leaves and branch vertices 
from the tree T. The remaining vertices form maximal directed paths consisting 
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of link vertices. Delete the first vertices of all paths. As a result we obtain a 
collection Q of directed paths. Let H = Up e gP. We will show that every arc uv 
with u,v £ V(H) is in H. 

Let P' £ Q. As in the proof of Lemma 2, we see that there are no forward 
arcs for P'. Since D is acyclic, there are no backward arcs for P' . Suppose uv is 
an arc of D such that u £ R' and »£?', where R' and P' are distinct paths from 
Q. As in the proof of Lemma 2, we see that u is either a sink or a descendent 
vertex for P' in T. Since R 1 contains no sinks of T, u is a descendent vertex, 
which is impossible as D is acyclic. Thus, we have proved that pw(XJG(H)) = 1. 

Consider a path decomposition of H of width 1. We can obtain a path de- 
composition of UG(£>) by adding all the vertices of L(T)US^ 2 (T)UF(T), where 
F(T) is the set of first vertices of maximal directed paths consisting of link ver- 
tices of T, to each of the bags of a path decomposition of H of width 1. Observe 
that the pathwidth of this decomposition is bounded from above by 



\L(T)\ + \S+ 2 (T)\ + \F(T)\ + 1 < (k - 1) + (k - 2) + (2k - 3) + 1 < 4fc - 5. 



The bounds on the various sets in the inequality above follows from Facts 1 and 



Corollary 1. For acyclic digraphs, the problem k-DMLOB can solved in time 

2 0(fclogfc) . n O(l) 

Proof. The proof of Theorem 7 can be easily turned into a polynomial time 
algorithm to cither build an out-branching of D with at least k leaves or to show 
that pw(XJG(D)) < 4fc and provide the corresponding path decomposition. A 
standard dynamic programming over the path (tree) decomposition (see e.g. [6]) 
gives us an algorithm of running time 2 ( fclosfc ) • n°^ x \ □ 

The following simple lemma is well known, see, e.g., [15]. 

Lemma 3. Let T = (V, E) be an undirected tree and letw : V — > M + U {0} be a 
weight function on its vertices. There exists a vertex v £ T such that the weight 
of every subtree T' ofT—v is at most w(T)/2, where w(T) = J2veV w ( v )- 

Let D be a strongly connected digraph with £ s (D) = A and let T be an out- 
branching of D with A leaves. Consider the following decomposition of T (called 
a (3- decomposition) which will be useful in the proof of Theorem 8. 

Assign weight 1 to all leaves of T and weight to all non-leaves of T. By 
Lemma 3, T has a vertex v such that each component of T — v has at most 
A/2 + 1 leaves (if v is not the root and its in- neighbor v~ in T is a link vertex, 
then becomes a new leaf). Let Ti,T 2 , . . . ,T S be the components of T — v 
and let Zi, Z25 be the numbers of leaves in the components. Notice that 

A < X)i=i < A + 1 (we may get a new leaf). We may assume that l s < Z s _i < 
■•• <h < A/2 + 1. Let j be the first index such that J2l=i h — I + 1- Consider 
two cases: (a) lj < (A + 2)/4 and (b) lj > (A + 2)/4. In Case (a), we have 



2. This proves the theorem. 



□ 
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In Case (b), we have j = 2 and 



A + 2 
4 



A + 2 
2 



A-2 



S 



3A + 2 
4 



< h < 



and 



2 



<5>< 



i=2 



Let p = j in Case (a) and p = 1 in Case (b). Add to Z? and T a copy i/ of 
v (with the same in- and out- neighbors). Then the number of leaves in each of 
the out-trees 



is between A(l + o(l))/4 and 3A(l + o(l))/4. Observe that the vertices of T" have 
at most A + 1 out-neighbors in T" and the vertices of T" have at most A + 1 
out-neighbors in T" (wc add 1 to A due to the fact that v 'belongs' to both T' 
and T"). 

Similarly to deriving T 1 and T" from T, we can obtain two out-trees from 
T' and two out-trees from T" in which the numbers of leaves are approximately 
between a quarter and three quarters of the number of leaves in T" and T", 
respectively. Observe that after 0(log A) 'dividing' steps, we will end up with 
O(A) out-trees with just one leaf, i.e., directed paths. These paths contain O(A) 
copies of vertices of D (such as v' above). After deleting the copies, we obtain a 
collection of O(A) disjoint directed paths covering V(D). 

Theorem 8. Let D be a strongly connected digraph. Then either i s {D) > k or 
the underlying undirected graph of D is of pathwidth O(fclogfc). 

Proof. We may assume that £ S (D) < k. Let T be be a 1-AE optimal out- 
branching. Consider a /3-decomposition of T. The decomposition process can 
be viewed as a tree T rooted in a node (associated with) T. The children of T in 
T are nodes (associated with) T 1 and T"; the leaves of T are the directed paths 
of the decomposition. The first layer of T is the node T, the second layer are 
T' and T", the third layer are the children of T' and T", etc. In what follows, 
we do not distinguish between a node Q of T and the tree associated with the 
node. Assume that T has t layers. Notice that the last layer consists of (some) 
leaves of T and that t = 0(log k), which was proved above (k < A — 1). 
Let Q be a node of T at layer j. We will prove that 



Since t = O(logfc), (1) for j = 1 implies that the underlying undirected graph 
of D is of pathwidth O(fclogfc). 

We first prove (1) for j = t when Q is a path from the decomposition. Let 
W = (L(T) U S^ 2 (T) U F(T)) n V(Q), where F(T) is the set of first vertices of 
maximal paths of T consisting of link vertices. As in the proof of Theorem 7, it 
follows from Facts 1 and 2 that \W\ < Ak. Obtain a digraph R by deleting from 
D[V(Q)] all arcs in which at least one end-vertex is in W and which are not arcs 
of Q. As in the proof of Theorem 7, it follows from Lemma 1 and 1-AE opti- 
mality of T that there are no forward arcs for Q in R. Let Q = V1V2 ■ ■ ■ v q . For 



T' = T[{v} U (U? =i nTi))] ^d T" = T[{v'} U (Ui =p+1 V (T,))} 



pw(UG{D[V{Q)])) < 2(t - j + 2.5)fc 



(1) 
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every j e [q], let Vj = {vi : i € [j]}. If for some j the set Vj contained k vertices, 
say {v[,v' 2 , ••• , v' k }, having in-neighbors in the set {vj + i,Vj +2 , ■ ■ ■ , v q }, then D 
would contain an out-tree with k leaves formed by the path Vj + iVj +2 ■ ■ ■ v q to- 
gether with a backward arc terminating at v[ from a vertex on the path for each 
1 < i < k, a contradiction. Thus vs(\JG(D 2 [P])) < k. By Proposition 2, the 
pathwidth of UG(i?) is at most k. Let (X\, X 2 , ■ . . , X s ) be a path decomposition 
of XJG(R) of width at most k. Then (X 1 U W, X 2 U W, . . . , X s U W) is a path 
decomposition of \]G{D\V{Q)\) of width less than k + Ak. Thus, 

pw(VG{D[V(Q)})) < 5k (2) 

Now assume that we have proved (1) for j = i and show it for j = i — 1. 
Let Q be a node of layer i — 1. If Q is a leaf of T, we are done by (2). So, we 
may assume that Q has children Q' and Q" which are nodes of layer i. In the 
/^-decomposition of T given before this theorem, we saw that the vertices of T' 
have at most A + 1 out- neighbors in T" and the vertices of T" have at most A + 1 
out-neighbors in T". Similarly, we can see that (in the /^-decomposition of this 
proof) the vertices of Q' have at most k out-neighbors in Q" and the vertices 
of Q" have at most k out-neighbors in Q' (since k < A — 1). Let Y denote the 
set of the above-mentioned out-neighbors on Q 1 and Q"; \Y\ < 2k. Delete from 
D[V(Q') U V{Q")\ all arcs in which at least one end-vertex is in Y and which do 
not belong to Q' U Q" 

Let G denote the obtained digraph. Observe that G is disconnected and 
G[V(Q')} and G[V(Q")} are components of G. Thus, pw(\JG(G)) < 6, where 

b = max{pw(VG(G[V(Q')])),pw(VG(G[V(Q")}))} <2(t-i + 4.5)fc (3) 

Let (Z±, Z 2 , . . . , Z r ) be a path decomposition of G of width at most b. Then 
(Z 1 UY,Z 2 UY,...,Z r U Y) is a path decomposition of VG(D[V(Q') U V(Q")}) 
of width at most b + 2k < 2(t-i + 2.5)k. □ 

Similar to the proof of Corollary 1, we obtain the following: 

Corollary 2. For a strongly connected digraph D, the problem k-DMLOB can 
be solved in time 2°( fclo s 2fc ) . n °W . 

6 Discussion and Open Problems 

In this paper, we initiated the algorithmic and combinatorial study of the Di- 
rected Maximum Leaf Out-Branching problem. In particular, we showed 
that for every strongly connected digraph D of order n and with minimum in- 
degree at least 3, £ S (D) = ^(n 1 / 3 ). An interesting open combinatorial question 
here is whether this bound is tight. If it is not, it would be interesting to find 
the maximum number r such that i s (D) = Q{n r ) for every strongly connected 
digraph D of order n and with minimum in-degree at least 3. It follows from our 
results that \ <r <j. 
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We also provided an algorithm of time complexity 2°( fclog k "> ■ which 
solves the /c-DMLOB problem for a strongly connected digraph D. The algo- 
rithm is based on a combinatorial bound on the pathwidth of the underlying 
graph of D. Instead of using results from Section 5, one can use Bodlacndcr's 
algorithm [9] computing (for fixed k) tree decomposition of width k (if such a 
decomposition exists) in linear time. Combined with our combinatorial bounds 
this yields a linear time algorithm for /c-DMLOB (for a strongly connected di- 
graphs). However, the exponential dependence of k in Bodlaender's algorithm is 
c k ' for some large constant c. 

Finally, let us observe that while our results are for strongly connected di- 
graphs, they can be extended to a larger class of digraphs. Notice that £(D) > 
£ S (D) for each digraph D. Let C be the family of digraphs D for which either 
£ S (D) = or £ S (D) = 1(D). The following assertion shows that C includes a 
large number digraphs including all strongly connected digraphs and acyclic di- 
graphs (and, also, the well-studied classes of semicomplete multipartite digraphs 
and quasi-transitive digraphs, see [7] for the definitions). 

Proposition 3 ([5]). Suppose that a digraph D satisfies the following property: 
for every pair R and Q of distinct strong components of D, if there is an arc 
from R to Q then each vertex of Q has an in-neighbor in R. Then D G C. 

Let B be the family of digraphs that contain out-branchings. The results 
of this paper proved for strongly connected digraphs can be extended to the 
class C fl B of digraphs since in the proofs we use only the following property of 
strongly connected digraphs D: £ S (D) — £(D) > 0. 

For a digraph D and a vertex v, let D v denote the subdigraph of D induced 
by all vertices reachable from v. Using the 2°( kl ° 6 fc ) • algorithm for k- 

DMLOB on digraphs in C fl B and the facts that (i) D v £ C fl B for each digraph 
D and vertex v and (ii) £(D) = m&x{£ s (D v )\v € V(D)} (for details, see [5]), we 
can obtain an 2 ( fcl ° s fc ) • n 0( -^ algorithm for fc-DMLOT on all digraphs. For 
acyclic digraphs, the running time can be reduced to 2 ( fclogfc ) • n ^ 1 ). 
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